Rebase #13757 to master #15189

ymjiang · 2019-06-09T08:33:37Z

Description

This is a rebase version of https://github.com/apache/incubator-mxnet/pull/13757

Details

https://github.com/apache/incubator-mxnet/blob/a38278ddebfcc9459d64237086cd7977ec20c70e/example/image-classification/train_imagenet.py#L42

When I try to train imagenet with this line commented, the train-accuracy reaches 99% while the validation-accuracy is only less than 50% (single machine, 8 GPUs, global batchsize=2048, Resnet50, fp32). Absolutely this is overfitting.

Then I uncomment this line and try again with the same experiment settings. This time both train and validation accuracy converge to about 66%, which looks like normal result.

Thus, it seems that this data augmentation is pretty important for ImageNet training. Perhaps it will be better to uncomment this as default, so that future developers won't get confused by the overfitting issue.

My commits enable data-augmentation with command-line argument.

https://github.com/apache/incubator-mxnet/blob/a38278ddebfcc9459d64237086cd7977ec20c70e/example/image-classification/train_imagenet.py#L42 When I try to train imagenet with this line commented, the train-accuracy reaches 99% while the validation-accuracy is only less than 50% (single machine, 8 GPUs, global batchsize=2048, Resnet50). Absolutely this is overfitting. Then I uncomment this line and try again with the same experiment settings. This time both train and validation accuracy converge to about 70%. Thus, it seems that this data augmentation is pretty important for ImageNet training. Perhaps it will be better to uncomment this as default, so that future developers won't get confused by the over-fit issue.

piyushghai · 2019-06-09T18:58:01Z

@ymjiang Can you make the PR title a bit more descriptive please ?
@mxnet-label-bot Add [pr-awaiting-review]

Roshrini · 2019-06-23T22:32:03Z

@ymjiang Can you please retrigger CI build?

ymjiang · 2019-06-24T05:57:13Z

@Roshrini Hi, I closed the issue and reopened it. Is that the correct way to re-trigger CI build?

roywei · 2019-07-08T16:28:11Z

@mxnet-label-bot add [pr-awaiting-merge]

wkcn · 2019-07-11T22:19:08Z

Thanks for your contribution!

shuo-ouyang · 2021-05-28T16:27:49Z

example/image-classification/train_imagenet.py

@@ -56,6 +54,8 @@ def set_imagenet_aug(aug):
        dtype            = 'float32'
    )
    args = parser.parse_args()


Maybe we should rearrange line 56-58? It looks like set_imagenet_aug() does nothing on args.

ymjiang added 6 commits October 16, 2018 16:29

Update .gitmodules

5498fd2

Add argument for imagenet data augmentation

52f2945

Enable data-aug with argument

bd4faf0

Merge pull request #1 from ymjiang/patch-1

255737c

Update .gitmodules

b4b81d2

ymjiang requested a review from szha as a code owner June 9, 2019 08:33

marcoabreu added the pr-awaiting-review PR is waiting for code review label Jun 9, 2019

ymjiang closed this Jun 24, 2019

ymjiang reopened this Jun 24, 2019

marcoabreu added the pr-awaiting-merge Review and CI is complete. Ready to Merge label Jul 8, 2019

wkcn merged commit 554b196 into apache:master Jul 11, 2019

shuo-ouyang reviewed May 28, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rebase #13757 to master #15189

Rebase #13757 to master #15189

ymjiang commented Jun 9, 2019 •

edited

Loading

piyushghai commented Jun 9, 2019

Roshrini commented Jun 23, 2019

ymjiang commented Jun 24, 2019 •

edited

Loading

roywei commented Jul 8, 2019

wkcn commented Jul 11, 2019

shuo-ouyang May 28, 2021

Rebase #13757 to master #15189

Rebase #13757 to master #15189

Conversation

ymjiang commented Jun 9, 2019 • edited Loading

Description

Details

piyushghai commented Jun 9, 2019

Roshrini commented Jun 23, 2019

ymjiang commented Jun 24, 2019 • edited Loading

roywei commented Jul 8, 2019

wkcn commented Jul 11, 2019

shuo-ouyang May 28, 2021

Choose a reason for hiding this comment

ymjiang commented Jun 9, 2019 •

edited

Loading

ymjiang commented Jun 24, 2019 •

edited

Loading